Deep-reinforcement-learning-based gait pattern controller on an uneven terrain for humanoid robots
نویسندگان
چکیده
Although conventional gait pattern control in humanoid robots is typically performed on flat terrains, the roads that people walk every day have bumps and potholes. Therefore, to make more similar humans, movement parameters of these should be modified allow them adapt uneven terrains. In this study, solve problem, reinforcement learning (RL) was used engage self-training automatically adjust their for ultimate control. However, RL has multiple types, each type its own benefits shortcomings. a series experiments were performed, results indicated proximal policy optimization (PPO), combining advantage actor-critic trust region optimization, most suitable method. Hence, an improved version PPO, called PPO2, used, experimental combination deep with data preprocessing methods, such as wavelet transform fuzzification, facilitated balance robots.
منابع مشابه
Online Learning of Uneven Terrain for Humanoid Bipedal Walking
We present a novel method to control a biped humanoid robot to walk on unknown inclined terrains, using an online learning algorithm to estimate in real-time the local terrain from proprioceptive and inertial sensors. Compliant controllers for the ankle joints are used to actively probe the surrounding surface, and the measured sensor data are combined to explicitly learn the global inclination...
متن کاملOperation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملAutonomous Reinforcement Learning with Experience Replay for Humanoid Gait Optimization
This paper demonstrates application of Reinforcement Learning to optimization of control of a complex system in realistic setting that requires efficiency and autonomy of the learning algorithm. Namely, Actor-Critic with experience replay (which addresses efficiency), and the Fixed Point method for step-size estimation (which addresses autonomy) is applied here to approximately optimize humanoi...
متن کاملWalking Pattern Generation on Inclined and Uneven Terrains for Humanoid Robots
This paper introduces a walking pattern generation method on an inclined terrain in both pitch and roll directions, and uneven terrain. The walking pattern generation method is based on a modifiable walking pattern generator (MWPG) which allows a zero moment point (ZMP) variation in real-time. As a navigational command set, a 3-D command state (CS) is defined, which consists of single and doubl...
متن کاملTraining an Interactive Humanoid Robot Using Multimodal Deep Reinforcement Learning
Training robots to perceive, act and communicate using multiple modalities still represents a challenging problem, particularly if robots are expected to learn efficiently from small sets of example interactions. We describe a learning approach as a step in this direction, where we teach a humanoid robot how to play the game of noughts and crosses. Given that multiple multimodal skills can be t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Optomechatronics
سال: 2023
ISSN: ['1559-9612', '1559-9620']
DOI: https://doi.org/10.1080/15599612.2023.2222146